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The fundamental role of glycoconjugates in many biological 
processes is now well appreciated and has intensified the 
development of innovative and improved synthetic strategies. 
All areas of synthetic methodology have seen major advances 
and many complex, highly branched carbohydrates and 
glycoproteins have been prepared using solution- and/or 
solid-phase approaches. The development of an automated 
oligosaccharide synthesizer provides rapid access to 
biologically relevant compounds. These chemical approaches 
help to produce sufficient quantities of defined 
oligosaccharides for biological studies. Synthetic chemistry 
also supports an improved understanding of glycobiology and 
will eventually result in the discovery of new therapeutics. 
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Abbreviations 

GlcNAc /v-acetylglucosamine 
GPI glycosylphosphatidylinositol 
HIV human immunodeficiency virus 

PSA prostate-specific antigen 



Introduction 

In addition to oligopeptides and oligonucleotides, oligo- 
saccharides (glycans) constitute the third major class of 
naturally occurring biopolymers that play a fundamental 
role in many important biological processes. Glycans are 
commonly found in nature as glycoconjugates (glycopro- 
teins or glycolipids) that show high structural diversity, 
greatly exceeding the diversity of proteins and nucleic 
acids. 

In contrast to linear oligopeptides and oligonucleotides, 
oligosaccharides are often complex branched molecules 
and the glycan core is commonly attached to proteins and 



lipids. In nature, three major classes of glycans exist: N- 
linked glycans, 0-linked glycans and glycosylphosphati- 
dylinositol (GPI) anchors. Intensive research into the 
biological role of carbohydrates has led to an increased 
need for sufficient quantities of natural and modified 
glycoproteins; however, the isolation of carbohydrates 
from natural sources is extremely difficult owing to their 
structural complexity. Access to pure carbohydrates for 
biological, biochemical, biophysical and medicinal stud- 
ies therefore relies on chemical and enzymatic synthesis 
[1,2]. Remarkable progress has been made in this area; 
however, further innovations are required to handle the 
structural complexity of oligosaccharides. Their prepara- 
tion is technically difficult, extremely time-consuming 
and performed by a few specialized laboratories. The 
introduction of solid-phase synthesis strategies has sig- 
nificantly improved carbohydrate assembly, as an excess 
of reagent can be used to ensure high yields and to reduce 
the number of purification steps. The development of an 
automated oligosaccharide synthesizer [3*, 4*, 5**] has led 
to rapid access to complex carbohydrates of biological 
relevance. This review highlights recent advances in the 
synthesis of complex oligosaccharides and glycoproteins, 
primarily focusing on strategies published in the past two 
years. 

/V- Linked glycoproteins 

jV-Linked glycoproteins (TV-glycans) are the most abun- 
dant in nature and are commonly divided into four groups: 
high-mannose, complex, hybrid and poly-iV-acetyllacto- 
samine glycans. Although the structural details are well 
established, little is known about their structure-activity 
relationship. In iV-glycans, the oligosaccharide sidechain 
is attached to the protein via an asparagine amino acid. All 
A^-glycans share the common pentasaccharide core struc- 
ture (mannose) 3 (A^-acetylglucosamine) 2 (Man 3 GlcNAc 2 ) 
shown in Figure la. Structural diversity is generated by 
variation in the substitution pattern of the pentasacchar- 
ide core, in the degree of branching and in the terminal 
sugars. The pentasaccharide core can be extended by up 
to five antennae. The preparation of the basic structure 
contains several synthetic challenges, including branch- 
ing and the inclusion of a p-mannoside. Recently, two 
efficient partial syntheses of the core structure have been 
accomplished [6,7], selectively establishing the p-man- 
nosidic linkage. The orthogonally protected P-mannosy- 
lated chitobiose trisaccharide with a terminal azido group 
serves as a key building block in the preparation of 
complex N-glycans. The entire pentasaccharide has been 
synthesized recently by Danishefsky and colleagues [8] 
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Figure 1 
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AMJnked glycoproteins, (a) Structure of the core pentasaccharide common to all A/-glycans. The core structure can be extended by up to 
five antennae (R). (b) Structure of a prostate-specific antigen (PSA) glycopeptide. The crucial retrosynthetic steps of Danishefsky's [21*] 
strategy are shown, (c) Structure of the gp120 glycopeptide fragment, which is a possible target for an anti-HIV-vaccine. Protein sequences 
are shown using the three-letter amino acid code. 
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using Crich's p-mannosylation methodology [6] followed 
by a simultaneous di-ct-mannosylation with a thiomanno- 
side donor. 

As an alternative to these solution-phase preparations, 
the synthesis of the core pentasaccharide selectively 
functionalized with one N-acetylglucosamine residue 
has been performed recently using a solid-phase approach 
[9]. The first automated solid-phase oligosaccharide 
synthesizer [5""] has been used to efficiently prepare 
the core pentasaccharide [10] by using an octenediol 
functionalized Merrifield's resin and three different 
building blocks: two monosaccharides and one disacchar- 
ide already containing the p-mannosidic linkage. Branch- 
ing was achieved by simultaneous dimannosylation of the 
trisaccharide core. 

Innovative synthetic methods have also provided access 
to more complex and highly branched N-glycans. Weiss 
and Unverzagt [11] have developed a general strategy for 
the preparation of multiantennary Af-glycans. Crucial 
challenges in the synthesis of these sterically crowded 
bi- to tetra-antennary compounds is the sequence of 
introducing the building blocks and the steric demand 
of the building blocks. Complex biantennary iV-glycans 
are also accessible via chemoenzymatic total synthesis. 
Elongation of synthetic oligosaccharides has been per- 
formed using glycosyltransferases to give full-length 
W-glycans [12,13]. 

Synthetic oligosaccharides are useful in gaining a more 
detailed understanding of glycoprotein quality control. In 
particular, maintenance of the integrity of protein folding 
has recently received significant attention. Ito and col- 
leagues [14,15] accomplished a convergent and stereo- 
selective route to the nonasaccharide Man 8 GlcNAc 2 and 
the monoglucosylated dodecasaccharide a-Gl^MangGlc- 
NAc 2 , a putative ligand of the molecular chaperones 
calnexin and calreticurin. These synthetic oligosacchar- 
ides might serve as molecular probes to detect glycopro- 
tein-mannosidase-like protein recognition. 

Glycoproteins are also important in the context of diag- 
nostics, therapeutics and vaccines. The integration of 
oligosaccharides into glycoproteins is realized by convert- 
ing them into anomeric glycosylamines, which is either 
performed by treatment with ammonium hydrogencar- 
bonate or by reduction of anomeric glycosyl azides, and 
subsequent attachment to the peptide chain [16-18]. Guo 
and colleagues [19] attached a fucoslyated trisaccharide 
to the peptide of the CD52 antigen by using a solution- 
phase synthesis with solid-phase workup or a combined 
solution- and solid-phase approach. More complex oligo- 
saccharides containing two thiol residues were linked 
to the same peptide by 9-fluorenylmethoxycarbonyl 
(Fmoc)-based solid-phase peptide synthesis [20]. 
Because of their short peptide chain containing only 12 



amino acids, their simple glycosylation pattern and their 
interesting bioactivity, these glycopeptides serve as use- 
ful models to study structure-activity relationships. The 
development of a universal strategy [21*] for the prepara- 
tion of complex multibranched A^-acetyllactosamine-type 
glycans from common precursors has led to the first 
chemical synthesis of normal and transformed prostate- 
specific antigen (PSA) glycopeptides (Figure lb). PSA has 
been identified as a highly specific cancer marker that 
might enable the early diagnosis of prostate tumours. 

A^-Linked carbohydrates also play an important role in 
human immunodeficiency virus (HIV) retroviral patho- 
genesis. The HIV-1 surface envelope glycoprotein gpl20 
is highly glycosylated containing up to 24 //-linked high- 
mannose carbohydrates and shows biological functions in 
helper T-lymphocyte infections [22]. Seeberger and col- 
leagues [23] developed a linear solution-phase synthesis 
of a triantennary high-mannose nonasaccharide from 
gpl20 using just three monosaccharide building blocks. 
Employing a reactivity-based one-pot self-condensation 
approach, Wong and coworkers [24] prepared several 
high-mannose oligosaccharides, which efficiently inhibit 
the binding of the antibody 2G12 to gpl20. More re- 
cently, Danishefsky and colleagues [25 - *,26**] described 
the first chemical synthesis of HIV gpl20 fragments 
(Figure lc), which serve as targets for an anti-HIV 
vaccine. 

Many important glycoproteins are multiply glycosylated 
at fixed sites. Danishefsky's laboratory [27] recently dis- 
closed a convergent method for the preparation of Afunc- 
tional glycopeptides: two glycopeptides are synthesized 
separately from their glycan and peptide precursors using 
standard procedures and subsequently coupled to yield 
the bifunctional compounds. 

O-Linked glycoproteins 

A second major group of biologically important glycopro- 
teins are CMinked glycoproteins (0-gIycans). The carbo- 
hydrate residue in O-glycans is covalently attached to 
the peptide backbone via the hydroxyl group of serine, 
threonine, tyrosine, hydroxyproline, hydroxylysine or 
another hydroxylated amino acid. In contrast to N-g\y- 
cans, these glycoproteins show a higher degree of struc- 
tural diversity and do not share a common core structure. 
Additional variety arises from further carbohydrate elon- 
gations of these backbones. 

Tumour-associated antigens (Figure 2a) like the T^-, T-, 
sialyl-T N and sialyl-T antigens as well as the sialyl Lewis 
X and sialyl Lewis A antigens were first found in mucins. 
Mucins are a class of highly 0-glycosylated proteins 
present on the surface of various types of epithelial cells. 
In normal tissue, the peptide backbone carries several 
complex oligosaccharides derived from the glycan core 
structures shown in Figure 2b, which are characterized by 
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Figure 2 



(a) HO ^OH 

HO- 



(b) 



(c) 



AcHN 



HO ^OH 



Ser/Thr 



HO /OH T<" 



HO, 



T N -Antigen 




OH AcHN £ er/Thr 



T- Antigen 



HO, 




C0 2 H 



AcHN^^i/ O 



AcHN Sern-hr 



Sialyl-T N -antigen 



AcHN^U^i/ 

HO HO | 



HO- 





AcHN 



HO^OH HO OH 

OH AcHN <L ern - hr 



a-2 r 6-Sialyl-T-antigen 



cc-2,3-Sialyl-T-antigen 



HO. 



AcHN- 



OH H0 2 C HO OH OH 
TtZ? OH V 



OH 



HO JOM 



OH 



hA oh 

Sialyl Lewis X antigen 



AcHN 
~OH 




AcHN 



HO 



°"* - TcT| AcHN 

-^2Zoh 
H i OH 



Sialyl Lewis A antigen 



H ° OR 6 

AcHN' ^CHg/H 

T 

Protein^ A ^Protein 

H » 
H O 



Core 1:R 3 = p(1,3)-Gal 

Core 2: R 3 = p(1,3)-Gal; R 6 = P(1,6)-GlcNAc 

Core3:R 3 = P(1,3)-GlcNAc 

Core 4: R 3 = p(1 ,3)-GlcNAc; R 6 = p(1,6)-GlcNAc 

Core 5: R 3 = a(1,3)-Ga!NAc 

Core 6: R 6 = p(1,6)-GlcNAc 

Core 7: R 6 = ct(1,6)-GalNAc 

Core 8: R 3 = a(1,3)-Gal 



Bn k° B n <T OBn Bn ?<T OBn /OBn BnO OBn 

p. JoPiv MVU OPiv 
PjvO 



,OBn 
O 




HO 



O 

"OPiv 




I OPiv 
PivO 



Current Opinion in Biotechnology 



O- Linked glycoproteins, (a) Structures of tumour-associated carbohydrate antigens that were first discovered in mucins, (b) Core structures of 
mucin-type O-linked glycans, a class of highry O-glycosylated proteins, (c) Structure of the Le v -Le x tumour marker. Ac, acetyl; Bn, benzyl; 
Piv, pivaloyl; TCA, trichloroacetyl. 
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an ^-acetylgalactosamine unit a-O-linked to serine or 
threonine. An increased expression of mucins is usually 
prevalent in tumour cells, where the carbohydrate chains 
are modified due to incomplete glycosylation and pre- 
mature sialylation. As tumour-associated glycans with 
peptide sequences of mucins constitute a promising 
target for the development of synthetic antitumour vac- 
cines, the chemical synthesis of such glycoconjugates has 
received considerable attention and several reviews 
devoted to this field of research have been published 
[28-30,3r]. 

A solid-phase approach [32,33] has been used for the 
stereoselective construction of several different mucin- 
type 0-glycans. Stepwise elongation of the carbohydrate 
led to the required highly glycosylated amino acid build- 
ing blocks, which were then incorporated into a solid- 
phase glycopeptide synthesis. Other branched O-glycans 
have recently been prepared by an efficient one-pot 
glycosylation approach using either giycosyl fluoride 
[34] or thioglycoside [35] building blocks. As sialylated 
derivatives of tumour-associated antigens are also present 
on the surface of cancer cells, the preparation of 0-linked 
sialyl oligosaccharides is important. Paulson and collea- 
gues [36] demonstrated that recombinant sialyltrans- 
ferases are ideal catalysts for the simple and efficient 
preparation of CMinked sialyl oligosaccharides by elonga- 
tion of a synthetic giycosyl amino acid. 

The application of non-natural amino acids in carbohy- 
drate vaccines has also attracted considerable attention 
[37], as these unnatural linkages might give an increased 
immune response. Danishefsky's group [38] investigated 
the synthesis of different giycosyl hydroxynorleucines, 
each containing a tumour-associated carbohydrate anti- 
gen. While the glycosylation of trichloroacetimidate 
donors with the amino acid predominately afforded the 
corresponding a-O-linked product, the reaction with a 
glycal epoxide donor provided the (3-0-linked product. 
The glycal methodology was also successfully applied to 
the synthesis of Lewis Y- and Globo-H-containing amino 
acids. 

More recently, an automated synthesizer has been used to 
accelerate the synthesis of the Lewis y -Lewis x tumour 
marker (Figure 2c) and the Lewis X and Lewis Y blood 
group antigens [39**]. Only five monomers were neces- 
sary for the efficient construction of the three target 
structures. 

GPI anchors 

GPI-anchored proteins are involved in many biological 
and physiological processes and have attracted consider- 
able attention since the first structure determination of a 
GPI in 1988 [40]. These naturally occurring glycolipids 
serve to attach proteins or glycoproteins onto eukaryotic 
cell membranes. All reported GPI structures share the 



basic core structure shown in Figure 3a with a linear 
tetrasaccharide attached to the 6-0-position of inositol. 
Besides this conserved general structure, considerable 
diversity exists within the GPI anchor family based on 
the variation of the substitution pattern on this pseudo- 
pentasaccharide backbone. In most cases, the core is 
further modified by species-specific carbohydrates, addi- 
tional phosphoethanolamine units and variations in the 
lipid moiety. Proteins or glycoproteins are linked to the 
non-reducing end by their C termini or a phosphoetha- 
nolamine group. Owing to the structural complexity of 
the GPI anchors that requires a detailed knowledge of 
lipid, phosphate and oligosaccharide chemistry, many 
chemists have focused on the synthesis of these motifs 
[41']. 

A linear solution-phase approach allows for the construc- 
tion of complex GPI anchors and for the preparation of an 
orthogonally protected derivative of the phosphorylated 
pseudo-pentasaccharide core [42]. Another variable con- 
cept for the preparation of branched GPIs was developed 
by Pekari and Schmidt [43]. The efficiency of this 
approach was demonstrated by the synthesis of the 
GPI anchors of rat brain Thy-1 and scrapie prion protein 
in their water-soluble and lipidated forms. This approach 
also allows further attachment of peptide residues or 
biological markers to the GPI anchor. Reichardt and 
Martin-Lomas [44] reported a soluble support-based 
approach for the synthesis of the GPI backbone. This 
method, using a polyethylene glycol-grafted polystyrene 
resin functionalized with a Wang-chloride linker, can be 
applied to the preparation of a small library of GPI 
precursors. 

GD52 antigens, simple GPI-anchored glycopeptides, are 
present on eukaryotic cells and play an important role in 
the human immune system. Initial studies aimed at the 
synthesis of sperm CD52, including the preparation of an 
acylated inositol [45] and the linkage to the peptide [46], 
were performed by Guo and colleagues. More recently, 
they reported [47**] the first synthesis of a skeleton 
structure of sperm CD52. In their strategy the glycopep- 
tide and the GPI anchor were prepared separately and 
subsequently linked by an amide bond to give the gly- 
copeptide-GPI conjugate (Figure 3b). 

Synthetic GPIs are promising vaccine candidates against 
malaria, as shown in a mouse model [48]. Annually, 
malaria infects 5-10% of the world's population and kills 
about 3 million people each year. The malaria parasite 
Plasmodium falciparum expresses a large amount of GPI 
anchored to a protein, and the GPI structure (Figure 3c) 
has been identified as the malaria toxin. A solution-phase 
synthesis of two malaria vaccine candidates with a 
pseudo-hexasaccharide backbone has recently been 
reported by Seeberger and collagues [49]. This strategy 
allows for scale-up to procure compounds for preclinical 
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Figure 3 
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GPI anchors, (a) Basic core structure of all GPI anchors, (b) Skeleton structure of sperm CD52. (c) Structure of the malaria GPI vaccine 
candidate. 


and clinical trials. The authors also demonstrated that the 
synthesis of this target can be automated effectively 
[50"]. The fully protected oligosaccharide was obtained 


in only 9 h starting from four monosaccharides and one 
disaccharide building block. Fraser-Reid and coworkers 
[51,52**] developed a method for the solution-phase 


Current Opinion In Biotechnology 2004, 15:615-622 


wwv.sciencedirect.com 



Carbohydrate diversity Holemann and Seeberger 621 



synthesis of a fully lipidated and phosphorylated malarial 
GPI pseudo-pencasaccharide using orthoesters and 
methyl a-D-glucopyranoside as the key building blocks. 

Conclusions 

Innovative synthetic methods are an important tool to 
create diverse carbohydrates. Recent advances in the 
preparation of complex oligosaccharides as well as entire 
glycoproteins containing N-glycans, 0-glycans and GPI 
anchors have been highlighted in this review. Highly 
branched carbohydrates and biologically relevant oligo- 
saccharides are now accessible via these methods, provid- 
ing sufficient quantities for biological studies. The 
availability of defined synthetic glycoproteins and glyco- 
lipids will significantly support biological investigations. 
The development of new strategies for the preparation of 
carbohydrates is fundamental for the understanding of 
carbohydrate-protein interactions, biosynthetic pathways 
and structure-activity relationships and will allow for the 
discovery of new targets for therapeutics, diagnostics and 
vaccines. The introduction of an automated oligosacchar- 
ide synthesizer has greatly accelerated access to many 
highly branched carbohydrates, and a series of biolog- 
ically relevant oligosaccharides has been efficiently pre- 
pared on this machine. Further improvement and 
extension of this technology could allow for the auto- 
mated synthesis of complex glycoproteins, proteoglycans 
and glycolipids using only one instrument and will even- 
tually enable even non-specialists to create biologically 
important compounds for biochemical, biophysical and 
medicinal applications. 
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